Session 4: Speech I

نویسنده

  • Richard F. Lyon
چکیده

The first paper, "Field Test Evaluations and Optimizations of Speaker Independent Speech Recognition for Telephone Applications," by Gagnoulet and Sorin of CNET, was presented by Christel Sorin. This paper discussed various ways of improving system usability and performance by optimizing both the dialog ergonomy and the recognition technology within the constraints of low-cost real-time implementation. Techniques discussed included use of field data in training, increasing the number of parameters, automatic adjustments of the HMM structure, and better rejection procedures. A brief discussion of the rejection rate versus error rate tradeoff ensued; nobody had any good data or ideas on how to make this tradeoff, so when one person suggested that the rejection rate should be adjusted to keep the error rate under 5 percent, we said OK and moved on.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Session 7: Speech Recognition I

This session presented a number of interesting papers on a wide range of topics concerning speech recognition: two papers on noise robust signal processing algorithms, one paper on approaches to large vocabulary continuous speech recognition, two papers on algorithms for reducing computation time, one paper on adding new words to the vocabulary, and one paper on theoretical issues concerning vo...

متن کامل

Data Collection And Evaluation

This session focussed on two inter-related issues: (I) performance assessment for spoken language systems and (2) experience to date in speech corpora collection for these systems. The session included formal presentations from representatives of SRI International, MIT's Laboratory for Computer Science, BBN Systems and Technologies Corporation, and Carnegie Mellon University's School of Compute...

متن کامل

THURSDAY AFTERNOON, 21 MAY 2009 GRAND BALLROOM I, 1:00 TO 4:00 P.M. Session 4pSW Speech Workshop: Cross Language Speech Perception and Linguistic Experience: Poster Session A

Perceptual similarity between Tone 2 and Tone 3 in Mandarin was widely discussed in previous studies Moore and Jongman 1997 , Huang 2004 , Bent 2005 . Other tonal contrasts are hardly addressed. However, recent findings of Mandarin tones show that Tone 3 and Tone 4 are confusing in terms of descent slope. A big difference between previous studies and current ones is kinds of Tone 3 stimuli: pre...

متن کامل

9th ISCA Workshop on Speech Synthesis

s 10 Keynote Session 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Oral Session 1: Prosody. . . . . . . . . . . . . . . . . . . . . . . . . 12 Poster Session 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 Keynote Session 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Oral Session 2: Deep Learning in Speech Synthesis . . . . . . . . . 25 Demo Session . . . ...

متن کامل

Note from the Editor: Special issue on speech processing and soft computing

This special issue of the Journal is devoted to the work of twelve eminent speech scientists who apply novel soft computing methods to address some of the most difficult and persistent problems facing speech recognition systems today. I first heard these scientists discuss their innovative soft computing algorithms at the University of Salamanca where the Sixth International Conference on Soft ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991